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Abstract 

In 1946, Behrend gave a construction of dense finite sets of in- 
tegers that do not contain a 3-term arithmetic progression (AP). In 
1961, Rankin generalized Behrend's construction to sets avoiding k- 
term APs. In 2008, Elkin refined Behrend's 3-term construction, and 
later in 2008, Green & Wolf found a distinct approach (albeit morally 
similar) that is technically more straightforward. This work combines 
Elkin's refinement and Rankin's generalization in the Green &: Wolf 
framework. A curious aspect of the construction is that we induct 
through sets that do not contain a long polynomial progression in 
order to construct a set without a long AP. The bounds for rk{N), 
the largest size of a subset of {1,2, .. . ,N} that does not contain a 
k element AP, are (where log = log2, for sufficiently large N, with 
n = [log k] ) 

^21ogiV 

4^2 log ' 

rk{N) > Ar2-"2'"-^'/''»^+2^iogiogiv^ 



The improvement over earlier work is in the simplification of the con- 
struction, the explicitness of the bound for r^, and in the log log term 
for general k. 

We denote by rfc(iV) the maximum possible size of a subset of {1, 2, ... , N} 
that does not contain k numbers in arithmetic progression. Behrend [1] 
proved that 

^3(iV) ^ ^ 1 



where log is the base-2 logarithm and each occurrence of C is a new positive 
constant. Sixty years later, Elkin |3] strengthened this to show that there 
are arbitrarily large satisfying 

rsjN) ^^ (logiV)V4 

and shortly afterwards Green & Wolf ^ arrived at the same bound by a 
different method. For A; > 1 + 2"~^, Rankin [TT] proved that for each e > 0, 
if is sufficiently large then 

rkjN) ^ 1 

where n = [log A;]. This was subsequently rediscovered in a simpler, but 
less precise, form by Laba & Lacey [9J. Together with the obvious rk{N) < 
rk+i{N), these are asymptotically the thickest known constructions. The 
primary interest in the current work is the following corollary of our main 
theorem. 

Corollary 1. Set n = \\ogk]. There exists a positive constant C such that 
for allN >1 

For every e > 0, if N is sufficiently large then 

N ~ \ e7r3/2 ^1 22v'2T^" 

Szemeredi's Theorem states that rfc(iV) = o{N), and the task of getting 
quantitative upper bounds on rk{N) has been mathematically fruitful. The 
currently-best upper bounds on rk{N) are due to Bourgain |2], Green & 
Tao [6j, and Gowers [5], respectively: 

rs{N) < Ar(loglogAr)2(logAr)-2/3. 

rfc(iV) <Ar(log log AT) -2 

It is natural to speculate as to whether the upper or lower bound on 
rk{N) is closer to the truth. Certainly, the upper bounds have seen a steady 
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stream of substantive improvements, while the main term of the lower bound 
has remained unchanged for 50 years. The reader is directed to a discussion 
on Gil Kalai's blog [8] for some relevant speculative remarks of Gowers and 
of Kalai's. 

To prove our result we need to induct through sets that do not contain 
more elaborate types of progressions. A fc-term D-progression is a sequence 
of the form 

Q(l),Q(2),...,g(A;) 

where Q is a nonconstant polynomial with degree at most D. For example, 
1-progressions are proper arithmetic progressions. The sequences 2, 1, 2, 5, 10 
and 1, 2, 4, 7, 11 are 5-term 2-progressions arising from the polynomials (j — 
2)^ + 1 and (j^ — j + 2)/2. In particular, a progression of integers may 
contain the same number in different places, and may arise from a polynomial 
whose coefficients are not integers. Also, note that the class of fc-term D- 
progressions is invariant under both translation and dilation. Let Vk^oiN) 
denote the maximum possible size of a subset of [1, iV] fl Z that does not 
contain any A;-term D-progressions. 

Theorem 1. Fix positive integers k, D and set n = [log(A;/-D)] . There exists 
a positive constant C such that for every N 

To explain what is new and interesting in the current work, we begin 
by summarizing the earlier constructions. Behrend's construction [1], while 
no longer the numerically best or most general, remains the most elegant. 
His initial observation is that a sphere cannot contain a 3-term arithmetic 
progression simply because a line and a sphere cannot intersect more than 
twice. Let S" be a set of points in all lying on one sphere and having 
all coordinates positive and smaller than P, and then let A be the image of 
S under the map Lp : (xi, . . . i— )■ a;i(2P)*~^. Because < Xj < P, 
addition of two elements of A will not involve any carrying. This is therefore 
a Freiman 2-isomorphism between S and A] that is, afi + ^2 = + 0:4 if and 
only if fpixi) + v^(x2) = '/'(xa) + (pixi). Since three integers a < b < c are 
in arithmetic progression if and only if a + c = b + b, this proves that A is 
free of 3-term arithmetic progressions. The only remaining work is to show 
that there exists a suitably large S, which Behrend did with the pigeonhole 
principle, and to optimize P and d in terms of A^. 
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Rankin combined three observations. His first observation was that Behrend's 
use of the pigeonhole principle could be replaced with a number-theoretic re- 
sult on the number of representations of a huge number as a sum of a large 
number of squares. The second is that a degree D polynomial cannot inter- 
sect a sphere in more than 2D points, and so Behrend's argument actually 
gives a lower bound on r2D+i,D- The third is that one can use a set that 
does not contain fc-term 2D-progressions to build as a union of concentric 
spheres with skillfully chosen radii. The corresponding set A (after mapping 
S as per Behrend, but with the radix 2P replaced by something much larger) 
will necessarily be free of A;-term D-progressions. This provided for an in- 
ductive bound. For example, rg = rg i is bounded in terms of rg 2, which is 
bounded in terms of rg 4, which is then bounded using Rankin's generalized 
Behrend argument. 

Elkin [3] improved Behrend's 3-term construction in two ways. First, he 
used the central limit theorem (and the pigeonhole principle) to guarantee 
the existence of a large 5; and second, he considered lattice points in a very 
thin annulus. Using an annulus instead of a sphere leads to a set S that is 
substantively larger but, unfortunately, does have 3-term arithmetic progres- 
sions. After removing a small number of points to eliminate the progressions, 
Elkin proceeded along the same line as Behrend, needing to optimize d, P, 
and also the thickness of the annulus. 

Green & Wolf [7] recast Elkin's argument in a way that avoids counting 
lattice points. In the ci- dimensional torus, they take S to be the intersection 
of a small box and an annulus. Using random elements uJ, a of the torus, they 
consider the map 93 : n 1— )■ nU+a. Letting A := {a: (p{a) G S}, this map 
is a Freiman 2-isomorphism between A and ^p{A). The randomness allowed 
them to easily count the size of A and the number of progressions in A that 
need to be removed. 

In the current work we recast Rankin's argument using the lessons of Elkin 
and Green & Wolf. We avoid Rankin's sum-of-squares number theory lemma 
by taking random uJ,a (we still need the pigeonhole principle, however). We 
find the right generalization of "an arithmetic progression in a thin annulus 
has a small difference" to D-progressions, and thereby generalize Elkin's 
result to improve Rankin's bound on r2D+i.D- Finally, by taking concentric 
annuli, we smooth out Rankin's inductive step. We note also that previous 
work has sometimes sufferecQ from a cavalier treatment of error terms. For 

^Some would say benefitted. 
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example, Elkin's "arbitrarily large A^" and Rankin's "1 + e" term can be 
eliminated with a little care. We have taken the opposite tack here, in places 
working for coefficients that are not important in the final analysis, but which 
we consider to be of interest. In particular, the refinement for stated in 
Corollary [1] constitutes about 15% (by volume) of this work. 

1 Notation 

Throughout, log and exp refer to the base-2 logarithm and exponential. Vec- 
tors are all given overlines, as in x, and all have dimension d. 

The parameters and d tend to infinity together, with much larger 
than d, and all little-oh notation is with respect to N and d. The parameter 
d is a dimension, and must be an integer, while N need not be an integer. 
The other fundamental parameters, the integers k and D, are held constant. 

We define the difference operator A to be the map taking a finite sequence 
{ai)i=i to the finite sequence {a^+i — at,)^lj. The formula for repeated differ- 
encing is then 



We note that a nonconstant sequence (aj) with at least D + 1 terms is a 
Z)-progression if and only if A^~^^{ai) is a sequence of zeros. If = p{i), 
with p a polynomial with degree D and lead term p^, then A^(ai) = (DIpd), 
a constant sequence. Note also that A is a linear operator. Finally, we will 
make repeated use of the fact, provable by induction for 1 < n < k, that 



A fc-term type-(?T,, a, h) progression is a nonconstant sequence Oi, 02, . . . , 
with k > n, ai = a, and n-th differences A^{ai) the constant nonzero se- 
quence (b). For example, if p is a degree n polynomial (with lead term 
Pn 7^ 0) and k > n, then p(l), . . . ,p{k) is a type (n,p(l), n!p„) progression. 

The open interval {a — b,a + b) of real numbers is denoted a ± b. The 
interval [1, A^] fl Z of natural numbers is denoted [A^]. For positive integers i, 
the box (±2~*~^)'^, which has Lebesgue measure 2"*^^, is denoted BoXj. We 
define BoXq = [-1/2, 1/2)'^, and define X mod 1 to be the unique element y 
of BoXo with X - y el/. 




A 




max a,; — mm Oj 
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A point X = {Xi, . . . , X(i) chosen uniformly from has components 

Xi independent and uniformly distributed in (—2^-^^^, 2^-^^^). Therefore, 
||a;||2 = '^'i=iXf is the sum of d iidrvs, and is therefore normally distributed 
as (i ^ 00. Further ||a;||| has mean iid '■= 2~'^^d/12 and variance ajj : = 
2-4^d/180. 

For any set A <Z [n], positive integer D, and sufficiently small positive 
real number 5, we define Annuli(A, n, D, 5) in the following manner: 



Annuli(>1, n, D,S) -.^ Ix e Boxd : 



where z & fi^^ao is chosen to maximize the volume of ANNULl(y4, n, D, d). 
Geometrically, Annuli(74, n, 5) is the union of 1^41 spherical shells, inter- 
sected with BoXj). 




2 Lemmas 

The following lemma is best-possible for k — 2D + 1. Improving the bound 

for larger k comes down to the following problem: if Q has degree D and all 
of |Q(1)|, . . . , are less than 1, then how big can the leading coefficient 

of Q be? 

Lemma 1 (Sphere-ish polynomials have small-ish lead coefficients). Let S,r 

he real numbers with < S < r, and let k,D be integers with D > l,k > 
2D + 1. If P{j) is a polynomial with degree D, and r — 6 < ||-P(j)||2 < r + S 
for j G [k] , then the lead coefficient of P has norm at most 2^ {2D)r^''^ 

Proof. In this paragraph we summarize the proof; in subsequent paragraphs 
we provide the details. Q{j) := ||-P(j)||2 — r is a degree 2D polynomial of j, 
and each of the 2D -\- 1 real numbers (5(1), . . . , Q{2D + 1) are close to zero. 
If they were all exactly zero, then Q would have more zeros than its degree 
and so would necessarily be identically zero. Just having that many values 
close to 0, however, is already enough to guarantee that the lead coefficient 
of Q is small. 

Let P(j) = Po + Pij H K PdJ^- We work with the degree 2D poly- 
nomial 

2D 

Qij) ■■= \\m\\l-r = Y.<inr, 

n=0 
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and note in particular that q2D = II-PdIH- As < 5 < r, we conclude that 

\Q{3)\ <J_ 

Set g, Q to be the column vectors (go, gi, • • • , 120)^, (Q(l)i • • • i Qi^D + 
1))^, respectively. Let M be the {2D + 1) x {2D + 1) matrix whose {i,j)- 
component is i^"^. We have the system of equations 

Mq^Q, 

which is nonsingular because M is a Vandermonde matrix. By Cramer's 
rule, the cofactor expansion of a determinant along the last column, and the 
triangle inequality, 

det(A/') 1 ^ E?fi"l%2D+iL 

= di^ = dS(M) g =S Vat(M)| 

By the formula for the determinant of a Vandermonde matrix (the relevant 
minors of M are also Vandermonde matrices), we find that 

IIP l|2 ^T^TjM^ 2^^ 

ll^^ll^ - - |det(M)| - (2D)! ^' 

completing the proof. □ 

Lemma 2 (Tight modular progressions are also non- modular progressions). 
Suppose thatp{j) is a polynomial with degree D, withD-th coefficient po, and 
setXj:=ujp{j)+a modi. If Xi,X2, ■■■ ,Xk are in BoXd and k > D+2, then 

there is a vector polynomial P{j) = Ylii=oPif ^'^^^ ^U) — foT j G [k], 
and D\Pjj = uJ D\pu mod 1. 

Proof. Since p has degree D, the {D + l)-th differences of p{l),p{2), . . . ,p{k) 
are zero, and therefore the (D+l)-th differences of aJi, 2:2, ■ ■ ■ ,Xk are modulo 
1, i.e., all of their components are integers. We will show that in fact all of 
their components are strictly between —1 and 1, and so they must all be 0. 
The {D + l)-th differences are given by (valid only for 1 < v < k — D — 1) 



A^^\x,){v)^J2(^t')^-''^'^^^'- 

i=Q ^ ^ 
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Denote the i-th component of xj by x^j\ As Xy+i E 3oXd, each component 
of x,,+i is in (-2-^-^2--°-^). Thus, the h-th component of A^+^{xi){v) 
satisfies 



1=0 ^ ^ i=0 ^ ^ i=0 ^ ^ 

and therefore ^^^^{xi) = (0). 
Now, 

L>!Pzj = A^(P(i)) = A^(xi) = uD\pD (mod T). 

As P(i) G BoX£) for 1 < i < A;, the above binomial-coefficient triangle- 
inequahty argument tells us that the components of /\.^{P{i)) are between 
-1/2 and 1/2, and so D\Pd = {wDlpo mod T) . □ 

Lemma 3 (Annuli has large volume). If d is sufficiently large, AC [n], 

2 

and 25 < 1/n, then the volume of Annuli{ A, n, D, 5) is at least -2^'^^\A\6. 
Provided that 5 logo? 0, the volume o/ Annuli({1}, 1, D, 5) is at least 

Proof. A uniformly chosen element x = {Xi, . . . ,X(i) of Box^) has the Xj 
independent and each uniformly distributed in (— 2~^~^,2~^~^). Thus ||!f||2 
is the sum of d iidrvs and has mean fiu :— 2~^^d/12 and variance a]-, := 
2~^^d/180. By the central limit theorem (CLT), the random variable 

\\x\\l - hd 



D+\ /„ D+1 



has a normal distribution, as ci ^ oo, with mean and variance 1. We would 
like to argue that 



S g-.V2 



vol Annuli({1}, 1, D, 6) > 2-'^^ ( / —=dx > 2"'='^ 25- 

'27r / \ VStt 




but wc cannot apply the CLT to an interval that is shrinking as rapidly as 
±6. We get around this by applying the CLT to an interval that shrinks 
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very slowly, and then using an analytic form of the pigeonhole principle to 
guarantee an appropriately short subinterval with the needed density. 

We could accomplish this using only the classical CLT, but it is expedi- 
tious to use the quantitative CLT known as the Berry-Esseen theorem [U Sec- 
tion XVI. 5], which is applicable since 

Pd := E [\Xf - 2"2^/12|3] = 2-^^(3 + 2y3)/11340 < oo. 

Let / be an interval whose endpoints depend on d. The Berry-Esseen theorem 
implies that 



P 



|a;||2 /i 

0"D 



e I 



> 



1 



exp{-x^/2) dx-2 



PD 



First we handle the case A = {l},n = 1. We have 



P 



FII2 ^ pD 



log d 



> 



> 



> 



1 



l/logd 

exp(— x^/2) dx — 2 -= — -= 

271 J -1/ log d {aD/\/d)^Vd 

^ ^ exp(-(l/logd)V2) ^ 



2tt logrf 



Vd 



V2/7r 
logfi 



(l-i(logd)-^-3(logrf)d-i/2^ 



Let / be the density function of ^ , and let xi be the indicator function 
of /. Since the convolution 

(/x±i/iogd) * X±s 
is supported on ±{l/\ogd + 6) and has 1-norm 



Il/X±i/ 



losdj 1 



IX±<5||l > 



log d 



(l-(logrf)-^) 25, 
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there must be some z with 



((/x±i/iogd) * X±&){z) > 



,0-^(1 -(logc/n) 25 



2/logrf+ 25 
1 + 5 log 



Consequently, vol Annuli({1}, 1, £), 5) > (^y^-o(l)j 2""^^ 5. 

Similar calisthenics make the following heuristic argument rigorous. Let 
G be a normal rv with mean and variance 1: 



vo1(Annuli(A, n, D, S)) 2-'^" Pi^,s 



>2-'^^P^,^ [Gg (-1,-1 + 25|A|)] 
1 



1 /.-1+25|A| 

2-^^^^ / exp(-xV2)da; 



> 2-^^^exp(-l/2)25|v4| 

V 27r 

> -2-'^^|v4|5, 
5 

where we have used 25 < 1/n to force the intervals —(a — ± 5 to be 
disjoint, and also to force —1 + 25|A| < 0. Since the final inequality is strict, 
we can replace the limit in the central limit theorem with a "sufficiently large 
c?" hypothesis. □ 

Lemma m is not best possible. However, the factor 2^"*"^ will turn out to 
be irrelevant in the final analysis. 

Lemma 4 (There are not many types of progressions). Assume k > D. 
There are fewer than 2^^^N'^ types of k-term progressions with degree at 
most D contained in [N] . 

Proof. We suppose that we have a fc-term progression oi, . . . , contained 
in [A^] of type {D',a,b), and find restrictions on D',a and b. First, fix D'. 
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There are clearly at most N possibilities for a. It is straightforward to prove 
by induction that for £ e {!,...,£)'} 




3 A base case and an inductive step 



Proposition 1 (Base Case). If k > 2D, then as N ^ oo 




(1) 



Proposition 2 (Inductive Step). If k > 2D, then there exists a positive 
constant C 



Let Aq be a subset of [Nq\ with cardinality rk,2D{No) that does not contain 
any /c-term 2D-progrcssion, assume 25Nq < 2~'^^, and let 

A := A{uJ,a) = {n e [N] : nuJ + a modi e A^NVLl{Ao, Nq, D,6)}, 

which we will show is typically (with respect to uJ, a being chosen uniformly 
from BoXo) a set with many elements and few types of /^-progressions. After 
removing one element from A for each type of progression it contains, we will 
be left with a set that has large size and no fc-term D-progressions. Since 
BoXq X BoXq has Lebesgue measure 1, this argument could be easily recast 
in terms of Lebesgue integrals, but we prefer the probabilistic notation and 
language. 

Define T :— T{uj,a) to be the set 



N 



> C2 



No 



where 





3b G M, D' e [D] such that A{uj, a.) contains 
a k-term progression of type {D', a, h) 
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which is contained in A{uJ,a). Observe that A\T is a subset of [A^] and 
contains no fc-term D-progressions, and consequently rk^D{N) > \A\T\ = 
\A\ — \T\ for every uj,a. In particular, 

TkAN) > [\A\ - \T\] = E^,^ [\A\] - E^,s [\T\] . (2) 

First, we note that 

N N 

Ez^,5[|A|] = eA] = J2^T,[neA] = AT vo1(Annuli(Ao, ATq, D, 5)). 

n=l n=l 

Let E{D',a,b) be 1 if A contains a fc-term progression of type {D',a,b), 
and E{D', a,b) = otherwise. We have 

|T| < E{D',a,b), 

{D',a,b) 

where the sum extends over all types {D', a, b) for which D' G [D] and there 
is a D'-progression of that type contained in [A^]; by Lemma H] there are fewer 
than 2'^+^A^^ such types. 

Suppose that A has a fc-term progression of type {D', a, b), with D' G [D]. 
Let p be a degree D' polynomial with lead term p^' 7^ 0, . . . ,p{k) a D'- 
progression contained in A, and — (^)- Then 

Xi := p{i) uJ + a mod T G Annuli(Ao, A^o, D, 6) C BoXd . 

By Lemma |21 the Xj are a D'-progression in M*^, say P{j) = Xl^o 

P{j) = Xj and D'\P]jr = D'lpo'uJ mod 1 = buJ mod 1. By elementary alge- 

is a degree 2D' polynomial in j, and since -P(j) = Xj G Annuli(Ao, A'q, -D, S) 
for j G [/c] , we know that 



aeAo ^ " / 



for all j G [/c], and also (5(1), . . . , Q{k) is a 2Z}'-progression. Define the real 
numbers aj G Aq, ej G ±5 by 

12 



We need to handle two cases separately: either the sequence (aj) is con- 
stant or it is not. Suppose first that it is not constant. Since Oj G Aq, a set 
without fc-term 2D-progressions, we know that A^'^+^(aj) 7^ (0), and since 
(oj) is a sequence of integers, for some v 



A2^+^(a,)(i;)|>l. 



Consider: 



whence 



(0) = A2^+^(Q(0) = ^A^^^^(a.) + A2^+^(6,), 



|A^^+^(Q)(.)| = ^|A^^+^(a.)(.)|> ^ 



iVo' ^ " - No 

Since |ej| < 6, we find that |A^'^+^(ej)(f )| < 2^'^+^5, and since we assumed 
that 26No < 2""^^, we arrive at the impossibility 

^ < |A^^+^(e.)(.)| < 2^^+^5 < ■ = 7^- 

VVq iVo iVQ 

Now assume that (oj) is a constant sequence, say a := Oj, so that 
for all j G [A;] . This translates to 

Il^(j)ll2 & f^D-{z- ^^-r7^)(TD ± S(Td. 

iVo 

Using Lemma [H the lead coefficient P/j/ of P{j) satisfies 

/4D ^,2x1/2 

^ Vmi ) ^^^^ = 

where F := 4^/(2^^). We have deduced that a,b) = 1 only if 



a a; 



+ a mod 1 G ANNULl(y4o, A^o, -D, 5) and \\bco mod 1||2 < y^FanS. 
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Since a is chosen uniformly from BoXq, we notice that 
Pa [acJ+ a mod 1 G ANNULi(ylo, A^o, -D, 5)] = vol ANNULi(ylo, A^o, -D, 5), 



independent of cJ. Also, we notice that the event {||6cJmod 1||2 < VFodE} 
is independent of a, and that since b is an integer, uJ mod 1 and buJ mod 1 
are identically distributed. Therefore, the event {||6aJmod 1||2 < \/Far)6} 
has probability at mostly 

volBALL(v/F^)^ rW2)d ■ 

where Ball(x) is the d-dimensional ball in M°' with radius x. It follows that 

Fzj,^ [E{D',a,b) = 1] < vo1Annuli(Ao, Ao,L',5) ■ vo1Ball(v/f^), 
and so 

^,a[\T\] < 2^+1^2 vol Annuli(Ao, Ao, D, 5) ■ vol Ball( ^^(7^5)- 
Equation ([2]) now gives us 

!^h^^l > vo1(Annuli(Ao, Ao,L',(5)) (l - 2^+^ N voIBall{^/ Fa ^6) 
Setting 

- 1_ ( ^ V^'' r(c//2)^/'^ 3^5 /2m rfV^ 
ttF V(rf + 2)2^+V A^/'^^B ^ evr V^Ja^/'^' 

we observe that 



T{d/2)d d + 2' 



^In fact, since we will shortly choose S so that FgdS — > 0, this upper bound cannot be 
improved. 
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3.1 Finish proof of Proposition [T] 

We set 



d:-- 



2 log AT 
D 



so that 6\ogd — )■ 0, and 



N 



> 



> 



d + 2 
d 

d~+2 
d 

d~+2 
V2 



-oil) 2-"'' 5 



(ID 



d 



2/d 



T{d/2fl'^ 



V2 



TC 



3/2 p 



-oil] 



-oil] 



-dD 



Tid/2) 



2/d 



(l + o(l))rf/2e 



2^"v/360 _ \ VI 



Define the error term e(A^) by 



dD + -log N= VSDhgN + e{N), 

and observe that for any integer i, we have e(x) monotone increasing on 
2(^+1) while N being in that interval gives d = i. By algebra 
e(2^'^/2) = 0, and also 

lim e(A^) = — . 

Af^cxp((d+l)2D/2) d 



It follows that e(A^) < D/iV^i\ogN)/D - 1). 
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From this, we see that 

D 



exp ( - {dD + ^ log A^)) > exp(-A/8DlogiV) exp ^ 



^2{\ogN)/D-l 
1 + o(l)) exp(-v/8DlogAr), 



which completes the proof of Proposition [T] 

3.2 Finish proof of Proposition 

We set 



2D\ \ A^^/d 



3^5 V V^// 

which accomplishes \2~'^^ < 25Nq < 2"^^. With this 5,Nq and Lemma [3] 
we have, 

^^^4^ - ^volANNULi(Ao,iVo,D,5) 

>T^-2-'^^|Ao|(5 
- 5 ' ' 



2 5 A/'o 
C2 



4 Proof of Theorem [T] 

We proceed by induction, with the base case of n = 2 following immediately 
from Proposition [T] We now assume that Theorem [T] holds for n, assume 
that k > 2"'D, and show that 

TkMN) (l0gAr)V(2n+2) 



N - exp {{n + l)2"/2£)™/(n+i) ' 
By Proposition |2l we have 

rfc,D(iV) ^ ^_ 1 r,,2D(iVo) 



iV - 2<^^ No 
16 



with No = CN'^'^d-^'^. Since k > 2"L) = 2"-i(2L)), the inductive hypothesis 
gives us 



TkAN) ^ ^ 1 



(log iVo) 1/(2") 



C 



> c 



exp (n2("-i)/2(2D)("-i)/"</b^) 
(logAro)V(2-) 



exp [dD + n2("-i)/2 (2D) ^ log C - i log + | log N) 
(logiVo)'/^'"^ 



exp (dD + n2(""i)/2(2L))("-i)/"y|logA^) 

with the final inequality coming from d being sufficiently large. 
Setting 

,n/2 / logA^^ 



2n/2 



we arrive at the error term and bound for it: 



rfD+n2("-i)/2 (2/^) (—1)/" ^ log AT = (n+l)2"/2D«/(-+i) (log AT) i/("+i) +e{N) 



where 



+ l)/}(n+2)/(n+l) 

e(Ar) < (1 + 0(1)) / < 



c 



n2"/2+i (log A^) - (logA^)V("+i)' 



Thus, 



exp|^- [dD + n2("-i)/2(2/})(-i)/- j^g^^^ > 

(1 + o(l)) exp(-(n + l)2"/2/}«/(-+i)(iogiv)i/(n+i)) 



5 Further Thoughts 

The approach here works mutatis mutandis for constructing a subset of an 
arbitrary set M oi N integers. The number of progressions in M becomes a 
critical parameter, and the inductive step is somewhat more technical. The 
specific changes are detailed in [TU]. 
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Further, the methods here can serve as a basic outhne for constructing 
thick subsets of a large arbitrary set that does not contain nontrivial solutions 
to a linear system of equations. This problem has seen recent progress due 
to Shapira [12j, but a universal thick construction remains elusive. 
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